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O ■ Abstract: In this paper the necessary conditions of optimality in the form of 

• maximum principle are derived for a very general class of variational problems. 

This class includes problems with any optimization criteria and constraints that 
can be constructed by combining some basic types (differential equation, integral 
equations, algebraic equation, differential equations with delays, etc). For each 
J> I problem from this class the necessary optimality conditions are produced by con- 

^ I structing its Lagrange function R and then by dividing its variables into three 

groups denoted as u{t), x{t) and a correspondingly, a are parameters which are 
constant over time. The conditions of optimality state that a non-zero vector func- 
tion of Lagrange multipliers exists such that on the optimal solution function R 
^ I attains maximum on u, is stationary on x, and the integral of R over the control 

period S can't be improved locally. Similar conditions are also obtained for sliding 
^ , regimes. Here solution is given by the limit of maximizing sequence on which the 

^ I variables of the second group are switching with infinite frequency between some 

(basic) values. 



1. Introduction 

The first formulation of the Pontryagin maximum principle was published in [T] 
for a problem of optimal high-speed response. The maximum principle was then 
extended to various other classes of variational and optimal optimal control prob- 
lems, see for example [2-5]. For each of these new classes the new proof of the 
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maximum principle was derived. Most of these proofs used Rozonoer approach 
[2], who pubhshed the first proper proof of the maximum principle based on using 
needle-like variations of control. The same proof was later used in Pontryagin's 
book [6], where the needle-shaped variations were called the Makstein variations. 
Pontryagin did not mention neither Boltyanskii's [1] nor Rozonoer's proofs, and 
did not acknowledge the crucial contribution of A. A. Fel'dbaum, who first stated 
the standard modern optimal control problem. 

V.M. Tikhomirov repeatedly emphasized that it is desirable to consider vari- 
ous forms of optimal control problems within a unified framework by stating their 
optimality conditions in terms of the Lagrange function. This approach was im- 
plemented in [7] for the problems described by the ordinary differential equations. 

In practice, the optimal control problems may also have many other types of 
constraints - integral equations, finite constraints, etc. The exact composition of 
these constraints may be very different in different parts of the control interval, 
etc. 

In this paper we will use the canonical form of a variational problem [8, 9] 
that includes a wide class of constraints and objective functions. We will prove 
the maximum principle for the problem in this form. The known proofs of the 
maximal principle rely heavily on the link between the problem's state (phase) and 
its control variables via the problem's differential constraints. The approach used 
in this paper is based on using the particular type of relaxation of the canonical 
extremal problem. Similar approach was applied by L.S. Yang, V.F. Krotov, V.I. 
Gurman, R.V. Gamkrelidze, and others [10-13] to obtaine bounds (estimates) on 
the solutions of the particular types of extremal problems, to investigate sliding 
regimes and to derive the sufficient conditions of optimality. 

When the particular problem from this class, with the particular type of con- 
straints and the particular type of the optimality criterion, is considered, then this 
problem is reduced it to the canonical form. This yields the problem's Lagrange 
function R and defines which of its variables will belong to the first group and 
should maximize R on the optimal solution. Thus, adding new constraint not only 
changes (adds a new term to) the problem's Lagrange function, but it also changes 
for which variables it is required to maximize it. 
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2. Equivalent transformations and relaxations of 
extremal problems 



An extremal problem is defined as a problem of finding the maximum of some 
criterion I{y) on the feasible set D 



D can be a set in a vector space i?" or in a space of functions. The functional 
I{y) is defined and bounded from above in D. The element y* £ D, for which I 
attains maximum is called the optimal solution, and the value of criterion I{y*) 
is called the value of the problem. The set D can belong to the vector space -R" 
or to the space of functions. If the solution does not exist, then the value of the 
problem is the exact upper bound of the criterion (sup/(y)) in D. In this case, the 
maximizing sequence is called the generalized solution. 

2.1. Equivalent transformations 

An extremal problem ([1]) can be transformed into another extremal problem. If 
the solutions of both the original and transformed problems are the same then the 
problems are called equivalent with respect to solutions. In their values coincide 
then they are called equivalent with respect to values. If both solutions and values 
are the same then the problems are said to be equivalent. Let us give a few examples 
of such transformations. 

(1) Applying monotone function to criterion transforms ([T|) into the following 
problem 



here Fq is a monotonically increasing function (if Fq is differentiable then its deriva- 
tive on / is positive almost everywhere). The problem ^ is equivalent to ([T]) with 
respect to the solution. 

(2) Adding vanishing term to the optimality criterion 



(1) 




(2) 



max / y G D 



(3) 



max / y G D 
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where the function ip{y) is equal zero for any y £ D. The problem JS]) is equivalent 
to the original one. 

The concept of equivalent extremal problems can be generalized for problems 
where feasible sets of the original and transformed problems are different. Consider 
two extremal problems: 

problem A : lA{y) max / y G Da', 

and problem Al : /ai(-z) max / z G Dai- 

Definition 1. The problems A and Al are equivalent with respect to solution if 
such one-to-one mapping between Da and Dai can be found that from 

(4) lA{yi) > lA{y2), (yi,?/2) g Da 
it follows that 

(5) Iai{zi) > lAiiz2), {zi,Z2) G Dai- 

Here zi is mapped onto yi, and Z2 to ?/2- We define the class of equivalent problems 
A as a set of all problem that are equivalent to A. The inequalities (|4|, ([5]) guarantee 
that the optimal solution of A corresponds to the optimal solution of Al. 
Example. Suppose the problem A and Al have the following forms 

fo{y) max / fo{y) > 0, a<y <b 
b 

J ./Mr)6{t - T)dT ^ max / /o(r) > 0. 

a 

here function /o is continuous and bounded on [a,b]. After introducing mapping 
of solutions of the problem A onto solutions of the problem Al 6{y^ — r) (where 
6 is the Dirac delta function), then Al £ A. 

2.2. Relaxation of extremal problem 

The method we use in this paper to solve extremal problems includes transforma- 
tion of the original problem into another problem with the same criterion and a 
larger feasible set [10, 11]. The simplest transformation of this type can be ob- 
tained by deleting one of the original problem's constraints. It is clear that the 
new problem will have a wider feasible set. The other transformations that can 
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be used here include adding time-dependent points to the feasible set of the orig- 
inal problem that includes only time-independent points, allowing discontinuous 
solution in addition to smooth ones, etc. All these methods transform the original 
problem into its relaxation. 

Since the original extremal problem A corresponds to the entire class A of the 
equivalent transformed problems, we shall call any member of this class a relaxation 
of A (unlike the conventional definition of relaxation 

Definition 1: The problem B (Isiy) ™ax / y E D^) is a relaxation of the 
problem A (lAiu) — > max / y € Da), if it is possible to single out such subset Db 
of Db that the problem B (Isiu) max / y G Db ) is equivalent to the problem A 
with respect to the solution. Therefore the problem B belongs to the class A. 

Note that feasible sets Da and Db can have different nature. For example one 
can be a vector space and another a space of real functions. In the general case Ia 
and Ib can be also different but they obey the inequalities Jl]), ([5|) on Da and Db- 
In particular if Da coincides with Db then I a and Ib are either the same or one 
a is monotonic function of another on Da- 

Sometimes criteria and constraints that determine D depend on a parameter 
A in such a way that for any A G Fx the problem Bx is a relaxation of problem 
A. We shall call such relaxation Bx{ lB{\,y) max / y £ Db\) a parametric 
relaxation- 
Consider problem A: lA{y) — max € Da where a finite set of constraints 
determines Da- Suppose for each of these constraints the norm Aj can be defined 
that measures its deviation from some nominal value (that is, from its value for 
the original problem). 

Definition 2: Problem is said to be well-posed if for any e > 0, 6 exists such 

that from inequality max(Aj) < 6 it follows that the absolute value of the deviation 
j 

of I\ for problem with constraints deviating from nominal values by 6j from the 
problem's value when all constraints have zero nominal values Sj = is less than 
e. 

Definition 3: Relaxation B: lB{y) — *■ max j y £ Db of the problem A is equiv- 
alent if 

(6) /■j= sup I^{y) = sup lB{y) = II- 

Note that the left hand side of this equality does not depend on I* and D^ , it 
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depends on II and -D_, that is, on the optimahty criterion and feasible set for any 
problem from the class of equivalent relaxations of A. 

Parametric relaxation is equivalent if equation ([6|) holds for at least one X EV\. 

The following statements follow from these definitions: 

Lemma 1: The sufficient condition for relaxation to be equivalent to the original 
problem is that for any solution of relaxation problem G Db it is possible to find 
a sequence {yi} C Da of the feasible solutions of the original problem such that 

(7) lim%(y,) = /B(yO). 

I— >oo 

For well-posed problems it is not necessary for {yi} to belong to D^. It is only 
necessary that in the limit i ^ oo each of the constraints tends to nominal value 
with arbitrary accuracy. Lemma 1 follows from the definition 3. If optimal solution 
y* of relaxation problem exists then y^ can be replaced with y* . 

Lemma 2: If y^ is the optimal solution of the problem A, B is an equivalent 
relaxation of A and Db D Da, then the necessary conditions of optimality for 
relaxation problem hold for y*A. 

Lemma 2 follows from the fact that y*A can not be improved on Da, and Da is 
a subset of the feasible set of the relaxation problem. 

Relaxations are used 

1) to reduce a conditional optimization problem to an unconditional problem, 

2) to find approximate solution in a class of maximizing sequences if the problem 
does not have a solution in L), 

3) to derive conditions of optimality and bounds on problem's value, 

4) to construct computational algorithms. 

The most widely used type of relaxation uses criteria Ia and Ib which have 
the same values on any element of Da and on element that corresponds to it in 
Db- If Da and Db are defined on the same space then Da coincides with Db and 
Vy G Da lA{y) = Isiy)- For this type of relaxation the sufficient conditions of 
optimality (Krotov lemma [10]) holds. It states that the sufficient condition for y* 
to be the solution of A is that y* is a solution of relaxed problem B and y* belongs 
to Da- 

If the initial problem has no solution, then its is sufficient for a sequence of 
feasible solutions of the initial problem to be problem's generalized solution (maxi- 
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mizing sequence) if this sequence approximates the solution of the relaxed problem 
with arbitrary accuracy. 



3. Canonical form of the variational problem and 
optimality conditions for sliding regimes 

3.1. Variational problem in the canonical form 

We shall call canonical the following variational problem: 



1 

(8) /= /'[/oi(i,x(t),n(t),a)+^/o2(t,x(t),a)5(t-tO 
I 



dt — > max 



subject the following conditions 

T 

(9) Jj(r) = j x{t),u{t),a, r) + /j2(t, x{t), a, r)<5(t - r)] dt = 0, 



VTe[0,r], j = l,...,m, u€Vu, a€Va, 

where a is vector of parameters that are constant on [0,T]; u{t) and x{t) are the 
piecewise continuous and the piecewise linear vector functions; values of u{t) belong 
to the closed bounded set V in the space R""; functions fji and fj2, j = 0, . . . ,m 
are defined on direct product of the feasible sets of its two arguments, they are 
both continuously differentiable on x, a and t and fji are continuous on u. 
u{t) denote variables that are among the lists of parameters of the functions fji 
for j = 0, l,...,m only (that is, fj2 are independent on u{t)). We shall them the 
variables of the first group. 

Lemma 3: Suppose the problem ((8|), ([1]) is well-posed with respect to problem's 
value ( according to with definition 2, where the norm of deviation from the nominal 
value of every constraint ^ is defined as Aj = max|Jj(r)|J, then the average 
relaxation of this problem 



± 

(10) 1 = j [fm{t,x,u,af + fo2{t, x, a)6{t - U) 



dt — > max 
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subject to 



(11) Jjir) = J ^fji{t,x,u,a,T) + fj2{t,x,a,T)5{t - t) 



VTG[0,r], j = l,...,m, ueVx, aeVa 
is equivalent to the problem 



dt = 



Here 
(12) 



IjT = J fji{t,x,u,a,T)P{u,t)du. 



Vu 



The probability density measure P{u, t) obeys the condition 
(13) P(n,t)>0, j P{u,t)du = l VtG[0,r]. 

Vu 

This relaxation is obtained by replacing the control variable u{t) with its mean 
value, that is, by applying randomization operation to u{t). It was used by Yang 
[12] to study problems with differential constrains. It was also used by Krotov and 
Gamkrelidze to study sliding regimes in optimal control problems. 
The proof of the Lemma 3 is given in the Appendix. 



3.2. Optimality condition for sliding regimes 

The unknown variables in the problem (fTO]) - (fTT]) is the measure P*{u, t), the vector 
function x*{t), and the vector a* . The solution of this problem obeys the following 
condition. 

Theorem 1 (optimality conditions for sliding regimes for the variational 
problem in the canonical form). Suppose P*{u,t),x*{t),a* is the solution of 
the problem ^^-m\). Then 

1. The optimal distribution of the randomized variables has the following form 

m 

(14) P*iu,t) = Y,j,mn-u''it)), 

where for all Vt G [0, T] the piecewise continuous functions ^v{t) obey the condition 

m 

l.it) > 0, ^ 7.(0 = 1. 

v=0 
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2. We can find a scalar Aq > and a continuous vector function A(r) = (Ai(t), \m{ 
which are not equal zero simultaneously with Aq on the interval [0, T] and are equal 
zero outside it, such that for the functional 



T T 

m 



Rdt 



(15) S = XoI + Y.J ^i{r)J,{T)dT = J 

J=l 
and its integrand 

m 

(16) R = \oRo + Y.Rj, 

m 

Ro = Y. 7^(i)/oi(t, x(t), u^(t), a) + J2 Mt, xit),a)6{t - ti), 



u=0 

T 



(17) R, = J A,(t) 





lv{t)fji{t, x{t),u'',a, r) + fj2{t, x{t), a, t)6{t - t) 



dr 



.u=0 

the following conditions hold 

(18) ^5a < 0, 

da 

, . bR 

(20) n'^(t) = argmaxi?(x, A, a*, n), ;/ = 0, ...,m, 
where ba is an admissible variation of the parameter a. 

u'^{t) are called the basic values of the vector function u. Some of the conditions 
of the problem (fTO]) - (fTT]) may not contain variables of the first group; in this case, 
the maximal number of basic values is less than m + 1. The proof of the theorem 
is given in the Appendix. 

Note that the relaxation (fTO]) - (fTT]) is equivalent to the original problem jS]), 
Therefore from Lemma 2 it follows that if the optimal solution (u*{t), 
x*{t), a*) of the initial problem exists in the class of piecewise continuous functions 
u(t), then it satisfies the optimality conditions (fT8]) - ([20]) . In this case 70 (i) = 1 
and the remaining multipliers 7j(t) in are equal to zero. 

The optimality conditions of the problem ([8]), ([9]) takes the form ([T8]) - (fT9]) with 

(21) M*(t) = argmaxi?(x, A, a*, u), 

ueVu 
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where 



(22) 



Ro = /oi(i, x{t),u{t), a)+Y, Mt, x{t), a)S{t - ti) 



T 



(23) 



/ 



Aj(T) fji{t,x{t),u{t),a,T) + fj2{t,x{t),a,T)S{T -t) dr. 







Thus, for the variational problem in the canonical form ([8]), ^ the maxi- 
mum principle conditions (fT8]l . (fT9]) . (f2T]) hold, where the function R includes 
contributions from the optimality citerion Rq and from each of the constraints 
Rj,j = 1,2, ...,m. 

4. Maximum principle for variational problems 
4.1. Necessary conditions of optimality for sliding regimes 

The theorem 1 ( (fTSl) . (fT9]l . (f2T]) ) allows us to obtain the conditions of optimality 
in the form of maximum principle for problem with various types of criterion and 
constraints. This is done in two steps. Firstly, the problem under consideration is 
reduced to the canonical form as we it was done in the previous section, yielding 
i?0 term for criterion and Rj terms for of its constraints. Then the variables of the 
first group with respect to every one of these terms are singled out. The variables 
which belong to the first group with respect to all of these terms are denoted as 
u(t). The maximum principle with respect to these variables ([2T]) holds. 

This process can be simplified if we derive contributions to Rq and Rj from the 
standard types of optimization constraints and optimality criteria together with 
the rules showing how to classify problem's variables for each such contribution. 
These contributions and rules are shown in Tables 1 and 2. 

We illustrate the derivation of such contributions to R for problem with con- 
straints in the form of the ordinary differential equation 



It can be rewritten using the 5 function and the Heaviside step function h{t) in the 



T 
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Table 1. 



Optimality criteria and the contibution to Lagrange function R 

from them 





Optimality criterion 
I max 


Term Rq 


Type of 
term 


1 


T 

J fa{y{t),a,t)dt 




^afo{y{t),a,t) 


Roi 


2 




XoFo{y{t),a,t)d{t-to) 


Ran 


3 


1= min /o(y(t),a,t) 
te[o,T] 


^ + Xit)[I-fo{y{t),a,t)] 
X{t) < 0, 

Xm* ~fo{y{t),a,t)]^0. 


Ron 



Note: If / = ^ flfc/fc, then i?o = ^ akRok- 

k k 

Table 2. 



Basic types of constraints and their contributions to R 



]V2 


Kind 

of constraint 


Term Rj 


Type 
of term. 


1 


T 

j f{y{t),a,t)dt^Q 




Xf{y{t),a,t) at t G (0,T) 
at t^(0,T) 


Rj 


2 


/(y(t),a,t) = 0, 
Mt G (0, T) 


A(t)/(y(t),a,t) at t G (0,T) 
at t^(0,T) 


R, 


3 


/(y(to),a,to) = 


A/(y(t),a,05(t-to) 


Rjii 


4 


i = f{x{t),u{t),a,t) 
at < E [0, T] 


V'W/(x(i),w(i),a,t), 
V'(t) = at t ^ [0, T] 

+ (x(0)/T)V'(0) 


Rji 
Rjii 


5 


t 

a;(0 = J f{x{T),u{T),T)dT + Xq 


t 

f{x,u,t) J A(r)dr 


Rji 







T 

A(f) = at i ^ [0,T] 
A(i)(a;(t)-x(0)) 


Rjii 


6 


T 

x{t)^ j f{x{T),u{T),a,T,t)dT 


T 

y" A(T)/(x-(t),u(t),a,i,T)dT 


Rji 









-A(Oa;(0 


Rjn 
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following form 



J(r) 



x(t)6{T -t)- f{x{t),u(t), t)h{T - t) 



t 



dt = 0, re[0,t]. 



From (fTTll it follows that Rj for this constraint has the form 



A(r) 



x{t)X{t) 



xmr -t) - f{x,u,t){h{T -t) 
t t 

f{x,U,t) J X{T)dT-^ J \{T)dT. 



dr 



Let us define the function such that 



m = x{t), m = 0, / xiT)dr = -m 



J A(r)d7 



We get 
(24) 



Rj = ipx + ipfix, u, t) + V'(O) 



Xq 



If the interval [0,t] and xq are fixed, the last term in ([24l) has no effect on the 
optimality condition. 

Other summands in the Tables 1 and 2 are obtained in the similar way. If 
necessity these tables can be extended for other types of constraints. 

Suppose we consider the problem of miximization of one of the optimization 
criterion listed in Tabvle 1. subject to any combination of constraints from Ta- 
ble 2. We classify unknown variables of the problem as the variables of the first 
group is they are among the parameters of the functions Rqi and i?oj for every 
optimality criterion and every constraint. Thus the optimality conditions for the 
particular problem of maximization of the criterion / shown in Table 1 subject 
to any combination of constraints from Table 2 can be obtained by summing up 
contributions to R from the corresponding terms in these two tables, and then by 
dividing problem's uknown variables into two groups using the following rule: the 
problem's variable is classified as the one from the first group if Rqi and Rjj for 
the optimality criterion I and for each of the constraints depend on this variable. 
All the other unknowns are classified as belonging to the second group. 
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We will denote the variables of the first group by u{t) and the variables of the 
second group by x{t). The problem may have no variables of the first group if, for 
example, all the variables are linked to each other via the finite equation (row 3, 
Table 2). 

Using the Tables 1 and 2 for each particular problem, we construct the function 
R as 

n 

(25) R = XoRo + Ru- 

u=l 

We denote all terms in ([25]l that depend on u{t) as H . We denote the rest of this 
expression as N . Thus, 

R = N{x, A, t) + H{x, u, A, t). 
Then the Lagrange function for the extended problem becomes: 

m 

R = N{x, \,t) + Yl lk{t)H{x, Uk, A, t), 

fc=0 

where m is the total number of the conditions of the problem, which contain u{t), 
m < n, while the conditions 7fc(t) satisfy the relations 

m 

(26) lkit)>0, ^7fc(t) = l. 

k=0 

The optimal solution of this problem in the class of sliding regimes is defined as 
such functions 7*(t) with components ^l{t) and u*{t) with components u^t), and 
also x*{t), such that u^. G Iki^) satisfies ^26\) . while the vector function x*{t) 
for any t G [0,t] can be approximated as exactly as desired by the sequence {xr{t)} 
of admissible solutions (by constraint equations) of the problem such that in this 
solution the functional I tends to its upper bound. The measure of closeness of the 
functions x*{t) and Xr{t) is the value that is maximum on t of the absolute value 
of their difference. 

The necessary optimality conditions in the class of sliding modes are given by 
the following 

Statement (corollary from the theorem 1). //7fc(i)) '"fc(*) {k = 0, . . . ,m), x*{t) 
is the solution of the problem of maximization of the functional I, listed in the 
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Table 1, over the set of admissible solutions, which is defined by the conditions 
from the Table 2 in the class of sliding regimes, then there exists a vector function 
A = (Ao, Ai(t), . . . , Xm{t)); Ao = (0; 1), that is not equal zero for t G [0,t] and is 
equal zero outside of [0,t]. For this vector function for almost all t G [Q,t\,and 
lk{t) > the function 

(27) H{x*,ul,X,t) = max H{x*,u,X,t), 

attains its global maximum on u at u%, and the extended function R is stationary 
on x: 

dN{x,X,t) ^ dH{x,Uk,X,t) 
(^^^ dx = - d-x ■ 

fc=0 

// the problem also depends on the vector of parameters a, then the conditions of 
optimality include equations for finding the optimal value a* from the condition that 
the functional S can not be locally improved with respect to a: 




where 5a is a feasible variation of a. 

All the functions that enter definitions of constraints of the problem must be 
continuous on u and continuous differentiable on x and a. For the existence of the 
maximum in ([27l) . it is sufficient that the set V be closed and bounded, and that 
the function H be bounded by u. 

4.2. Maximum principle for problems with the scalar 
argument 

In H has maximum on u for almost all i at a single point (70 (t) = 1), then the 
problem has the solution u*{t) in the form of the piece- wise continuous function. 
From ([271) ■ ([29]) it follows that there exists a vector function A = (Aq, • • • , Am(i)); 
Ao = (0; 1) that is not equal zero on t G [0,i] and is equal zero outside of this inter- 
val. This vector function is such that at almost all t G [0, t], the following conditions 
holds the maximum principle for problems with the scalar argument: 

(30) H{x* ,u* ,t, A, a*) = maxH{x* , u, t, A, a*), 
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(31) 
(32) 



dN 
dx 



dH 

dx ' 



dS_ 

da 



6a 



d_ 

da 



j [N{x, A, a, t) + H{x, u, a, A, t)] dt 





5a < 0. 



4.3. Examples 

Pontryagin maximum principle. As one of the examples of obtaining the 
necessary conditions of optimality in the form of (f30]l — (f32l) . we will consider the 
problem 



(33) 



j fQ{x,u,t)dt + Fo{x{t)) — > max, 





Xv = fu{x,u,t), Xy{Q)=XM, u = l,...,m, u£V, 

in which all constraints are in the form of differential equations. 
The Lagrange function here becomes 

m m 

(34) R = XoRo + Y,Ru = Ao/o + J2^4>ufu + ^yXu) + XoFo{x)5{t - t). 

u=l v=l 

Note that the variables u here enter only terms Ron and Rju. Hence, u (controls) 
here belong to the variables of the first group. There are no parameters in the 
problem ([33]l and the optimality conditions (assuming that solution exists in the 
class of functions x{t) that is differentiable for almost all t and functions u{t) that 
are piece wise continuous for almost all t) take the form 

dR 



(35) 



ti*(t) = argmaxi?(A, u, x*), „ 

U&V dXy 



0, i/ = l, 



This condition, after taking into account (|34ll . can be reduced to the following 
conditions 



(36) 



u*{t) = argmaxff('0, n, x*), 

u&V 

■ dH dFo 

ipu = —- — = Aot; — 0[t — t), u = I, . . . ,m. 



dxu dxy 

Here, the function H (Hamilton function) is the sum of all terms in R that depend 
on u: 

m 

H = Ao/o + E V'./.- 



15 



If we take into account that outside of the interval [0,t] ip = 0, then from (f36ll 
it follows that ipuit) has a break at t and 

OF 

(37) i/j^(t) = Xo^ — , u = l,...,m. 

The conditions ([36I1 . (f37| are to be solved together with the differential equations 
([33]) and the boundary conditions for x. 

The proposed approach to obtaining the necessary conditions of optimality 
allows us to trace how these conditions change when we add new constraints to the 
problem. For example, suppose we add to the problem ([33]) the condition 

(38) F{x(^)) = 0. 
This adds the following term to R 

Rj = XF{x)6{t-t). 

The optimality conditions ([36]) )do not change for t <t. For t = t the ijjy is now 
equal to 

dFn ~ dF 

(39) "^^^^^ "^^aT + ^aT' ^ = 1'---'"^- 



The additional variable A is to be found from the condition ([38]) . 

Fq in ([33]) and F in ([38]) can also depend on controls. In this case the conditions 

of optimality do not change for t < t. But the control u(t) now obeys weaker 

d 

condition of local optimality — [Aq-Po + XF]5u < 0, because it turned out that 

ou 

control here belongs to the variables of the second type for t. 

Butkovskii optimality conditions for the problem with constraints in the 

form of integral equations[3]. For the problem 

t I t 

(40) / = / /o(x, n, t)dt ^ max / / /(x(r), n(r), T)dt - x{t) = 

J U&Vu / J 

' 

the Lagrange function (using Tables 1 and 2) has the form 

t 

R = XoUx,u,t) + J XiT)f{x{t),u{t),T,t)dT - X{t)xit). 
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Here u{t) is the variable of the first group. The optimahty conditions will take the 
form 



dx dx 



A0/0 + j HT)f{x,U,T,t)dT 




(41) 



u 



are max 



i 

A0/0 + J HT)f{x,U,T, t)dT 



Combination of differential and integral equations. In many cases, it is 
more convenient to describe a linear object by a convolution equation rather than 
by a differential equation. We will show how the replacement of the differential 
equation (for definiteness, the equation for the m-th differential constraint) by the 
convolution equation of the form 



u{T)k{t - T)dT - x{t) = 0. 



will effect the necessary conditions of optimality (the maximum principle). Here, 

k{t) is an impulse transfer function. 

t 

Instead of the term Um{t) j \{T)k{T — t)dT the function H now includes the 



term ipmfm- Similarly the function now includes {—XmXm) instead of {ipmXm)- 
The division of the problem's variables between the first and the second groups do 
not change. The maximum principle here follow directly follow from (I30p -( p2]l and 
do not require any special derivation. 

Problems with the conditions in the form of inequalities and with 
the maximin criterion. Some of the conditions of a problem may have the form 
of inequalities. To obtain the optimality conditions using the proposed approach 
these inequalities can be rewritten in the form of equalities using additional artificial 
variables. For example, the inequality 



(42) 



f{y{t),t)>o 



can be rewritten as the equality 

f{y{t),t)-z{t) = 0, 
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using additional artificial nonnegative variable z(t). The appropriate term in R 
has the form 

R, = \{t)f{y,t)-X{t)z{t). 

The variable z{t) belongs to the second group and does not enter into other 
terms in R, except R^. The conditions of local optimality of i? by z (after taking 
into account that Z is nonnegative and its feasible variation 5z > 0) yields 

^Sz > ^ ^ > ^ A(t) > 0. 
oz oz 

Here X{t) = if z{t) > 0, that is, if f{y,t) > 0, and X{t) > if f{y,t) = 0. This is 
an exact analog of the condition of complementary slackness in the mathematical 
programming. 

For the maximin problem 

(43) / = min My{t),t) max. 

te[o,t\ 

we can use the same method by adding an additional parameter a to the problem 
which is independent of t. Then the problem can be rewritten as 

(44) a — > max 
subject to inequality that holds for any t G [0,t], 

(45) fo{y{t),t)-a>0. 

The criterion (|44| and condition ([451) contribute the following terms to the 
function R 

R = Xo^ + X{t)fo-X{t)a, 
where (similar to (H2]l X{t) > 0, and 

X{t)[fo{t,y*{t))-a*]=0. 

Here, Aq = 1 if a non-degenerate solution exists. Otherwise Aq = 0. Because all 
terms in R (except from R) do not depend on a and because a is unconstrainted, 
the condition of stationarity of the Lagrange functional 5" on a yields 
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APPENDIX 



Validity of the Lemma 3 follows from the Lemma 1 and the fact that for any 
solution P°(w,t), x'^{t), a° we can find the sequence of points 



Indeed, for any given t,x,a,T, the vector / = (/q, /j^, /„) belong to the convex hull 
Q of the set Q which is obtained by mapping of Vu onto (m + l)-dimensional space /. 
The solution maximizes /o with respect to u. Therefore, it belongs to the upper bound 
Q and can be obtained as a linear combination of no more than (m + 1) elements of Q 
(Carathiiodory's theorem). 

For any solution -P°(m, t) that has the form (fT3| . it is possible to construct the sequence 
{ui{t)} of solutions of the problem ([HI, ([9]) by dividing the interval [0, r] into i subintervals 
Ai,...,Ai and assuming that 7j/(t) and u'^{t) are constant on each of these intervals. 
Suppose their values are denoted as j„r and (i^ = 0, . . . , m) correspondingly. We shall 
call the problem obtained in this way the discretization of the problem (fTO|) , (fTTj) . 

We divide the interval [0, r] in the problem (lU, Q in a similar way. But here we divide 
each subinterval into (m + 1) smaller intervals. Thus, we divide into A^o, A^i, A^™, 
and we get A^iy/Ar = j„r- We assume that the variables u{t) in the problem ^ are 
piece- wise constant and are equat to u'^ on the interval A^, . For the solutions constructed 
in this way on each of the intervals A,.(r = 1, ...,i), the values of the functional / and 
J(t) in the problem ([8]), ^ are equal to the values of the corresponding functional for 
the discretization problem (flOl) , (fTTj) . If i — > oo then Ar tend to zero uniformly in r and 
Id and Jd{t) for the discretization of the averaged problem become arbitrary close /(r) 
and J{t). Because the problem under consideration in well-posed (definition 2) the same 
is also true for / and J(t). The Lemma 3 is proved. 

For the proof of the theorem we use the following statement: 

Suppose y*{t) is the solution of the following problem 



{zj = {«.(<), a;°(0,a°}, 



on which 



I{zi)~^I*, Jj{T,Zi) —> Jj{T) ^Q, i^oo,j = l,m. 



T 



(47) 








subject to constraints 



T 



(48) 
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where f is continuous and continuously differentiable with respect to all arguments. Then 
a non-zero vector 

A= (Ao,Ai(t),...,A™(t)), Ao>0, 
can be found such that for y ~ y* the following inequality holds 

(49) (f)^^^O' 
where 

T 

m m „ 

3=1 3=1 i 

and where Sy is the feasible variation of y{t) with respect to the condition y £ Vy{t). 

Proof. For simplicity we assume that m = 1. After expanding /o and /i near y*{t) 
and neglecting higher than linear terms we get: 

fo{y,t) = fo{y*,t) + {dfo/dy)Sy, fi{Y*,t) + {df,/dy)5y. 

Suppose the problem is non degenerate. That is, that y*{t) is not an extremal of Ji(t) 
for any r e [0, T]. Then such value t = ti(T) can be found that 

(50) {dfi{y*M,T)dy)^Q. 

Consider the variation of the solution 5y which differs from zero only over two infini- 
tizemal time intervals e* around ti(r) and the arbitrary t2 S [0,T]. We denote 

(Ji = 5y{t)dt, (72 = 5y{t)dt. 

Variations of / and Ji (r) take the form 

51 = idfo/dy)t,cT, + {dfo/dy)t, CJ2, 5Ji{t) = {df - l/dy)t,<Ji + {dfi/dy)t,a2. 
Since the latter expression is equal zero for arbitrary t e [0, T], we can rewrite it as 

T T T 

(51) / G{T)6J,{T)dT = a, [{dT,/dy)tMr)dT + <J2 [ {dfo/dy)tMr)dT = 



for arbitrary function G{t), which obeys the conditions G{t) > 0, J G{T)dT = 1, and 



the variation of the functional / over the set of variations Sy, feasible with respect of 
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conditions l(5T|l . must non-positive: 



SI = (72 



1 

{dfo/dy)t, - {dfo/dy)t, J {dh/dy)t,G{r)d7 



{dh/dy)tMr)dT 



< 0. 



From l|50p it follows that the denominator of the fraction in the square bracket is non zero. 
Denoting 



1 / g 1 

and taking into account that t2 can have an arbitrary value, we get 

T 

Uy,t)+ f X{T)h{y,t,T)dT Sy<0. 



dfi 



(52) 



d_ 

dy 



If the condition l(50|) does not hold then a non-zero function Ai(t) can be found such that 

T 

\l{T)h{y,t,T)dT =0. 



(53) 



d_ 

dy 



Combining l(52|) and l|53p yields the condition of optimality ijiO)) , where for a non-degenerate 
problem we can set Ao = 1. 

The problem HO]), (HI]) for the distribution P{u,t) in the form Hi]) has the form l(47|) . 
(|48|. with 

i? = i? — i?„i+l = Aq-Ro + Rj — Rm+1, 

j 

where Ra and Rj have the form ifTT)) . and the term i?m+i corresponds to the condition 

m 

^7.(0-1 = Vte[0,T], 

1^=0 

which can be rewritten in the form l(48|) as 

T 

>/m+l(T) = 



/ ~ ^) '^^^ " ^^'^^ = Vr e [0, T] 



Thus, 



= j Xm+iir) - '^(* - ^)^^ = A™+i(t) - 1^ 
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From the conditions (|49| for 

dR , 

7^57^ < 0, 7^ > 
it follows that for the basic values u^^t) (where 7iy(i) > 0) 

R{x,X,a* ^u") = A„i+i(i), v = {),... ,m, 

and for u ^ u'^{t), 7i/(t) — and S^^ > 0, and, hence, A, a*, u) < Am+i(<). Therefore 
the maximum condition (|20|) holds. 

The condition lfT9|) follows from (|49|) if we take into account that x is unconstrained. 
The conditions ifTS)) follow from the fact that with respect to the vector of parameters a, 
the problem (flOl) . fTTjl is a nonhnear programming problem and S is its Lagrange function. 
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